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Abstract. From a suitable integral representation of the Laplace transform of a 
positive semi-definite quadratic form of independent real random variables with not 
necessarily identical densities a univariate integral representation is derived for the 
cumulative distribution function of the sample variance of i.i.d. random variables 
with a gamma density, supplementing former formulas of the author. Furthermore, 
from the above Laplace transform Fourier series are obtained for the density and the 
distribution function of the sample variance of i.i.d. random variables with a uniform 
distribution. This distribution can be applied e.g. to a statistical test for a scale 
parameter. 
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1. Introduction 

Exact results for the distribution of the sample variance from non-normal samples 
are hardly found. Early investigations, concerning from a sample of gamma random 
variables can be traced at least to Craig (1929) and Pearson (1929), who used moment 
approximations, later investigated more thoroughly by Bowman and Shenton (1983). 
For general distributions various approximations were proposed by Box (1953), Roy and 
Tiku (1962), Tan and Wong (1977) and by Mudholkar and Trivedi (1981). The latter 
authors recommend transformations of the Wilson-Hilferty type. For a mixture of two 
normal distributions see also chapter 4.7 in Mathai and Provost (1992). This monograph 
treats quadratic forms, mainly from normal random variables. For the cdf of a positive 
semi-definite quadratic form of normal r.v. a simple integral representation on [0, tt] is 
given by formula (31) in Royen (2007a) as the univariate special case of some multivariate 
distributions. The interest in the distribution of or 1/s was also stimulated by the 
estimation of process capability indices, see e.g. Pearn, Kotz and Johnson (1992) and 
Pearn and Kotz (2006). 

In this paper the cumulative distribution function (cdf) of (n — l)s^ = — 

X)'^ is derived for i.i.d. random variables Xj with a gamma density x°'~^e~^ /r{a) and 
for uniformly distributed Xj . The first distribution was already given by a double series 
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in Royen (2007b), but now a univariate integral representation of the cdf is derived in 
section 3 avoiding the recursive computation of the coefficients of the above series. The 

cited paper contains also the exact distribution of [ 'YTj=\ -^jj / -^j including 

Greenwood's statistic if a = 1. The distribution of (n — l)s^ from a sample of uniformly 
distributed random variables is obtained by a Fourier series in section 4 and applied to 
a statistical test for a scale parameter. 

Both these distributions are derived from a suitable integral representation of the 
Laplace transform (LT) of an " m-factorial" positive semi-definite quadratic form 

Q = X'{D - CC')X (1) 

of independent real random variables X' — (Xi, . . . , Xn) with not necessarily identical 
densities, with positive definite D = diag(di, . . . , (i„) and an n x m-matrix C = (cj>) of 
rank m with real or pure imaginary columns c^. 

In particular, with i.i.d. random variables and m = 1 the form Q is reduced to 
(n — l)s^ if D is the identity matrix and Cj = for all j. The LT of Q is given in 

the following section. 

Formulas from the handbook of mathematical functions by Abramowitz and Stegun 
are cited by A.S. and their number. The standard normal density is denoted by and 
8 means "expectation". 

2. The Laplace transform of an m-factorial positive semi-definite quadratic 
form 

The LT or Fourier transform (FT) of a sum of independent random variables can 
be written as a product of univariate integral transforms. To preserve this advantage 
for a quadratic form of the type in ([1]) as far as possible its LT is represented by an m- 
variate mixture over a product of univariate integral transforms. This is accomplished 
essentially by the equation 

/oo 
exp{yz)(p{y)dy = exp(zV2), z £ C. (2) 
-oo 

Inspite of its simplicity the result is stated formally as a theorem. 

Theorem 1. Let Q = X'{D — CC')X be a positive semide finite quadratic form 
of the type described in (QJ). Then the LT of Q is given by 

r, n mm 

£{exp{-tQ)) = / ( n f ( exp ( - td,X^ ± V2tX, ^ c,^y^))) • [] ^iyf^)dy^. (3) 
with the standard normal density ip,t>0, all dj > 0. 

Remark: Replacing t by —it in fS]) the characteristic function of Q is obtained if Q 
is definite or indefinite and absolutely bounded. 
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With i.i.d. random variables, m — I, dj = 1, Cj — l\/ri we get 
£ (exp(-tg)) = S (exp(-t(n - 1)5^) 

{£{eM-tX^±VW^Xy)))"ipiy)dy (4) 

— OO 
1 /"^ 

= ^ / {8{eM~{VtX - y/V^f))Ydy. 

\ i-K J-oo 

Proof of theorem 1 : Let be m = 1: 

n n 

£:(exp(-iQ)) = £:(cxp(-i^d,x2 + t(^CjXj)')) 

= £:(exp(-t^d,X2) / exp(±yV27^c,XjXj/)dy 

j=i 3=1 

applying ^ with z — ±^/2i^- CjXj. 

For real Cj the integrand is only positive. Thus, equation ^ is obtained by chang- 
ing the order of integration. For imaginary values Cj the integrand is majorized by 
exp{—tX' DX)(p{y) and equation ^ follows in the same way. The generalization to 
TO > 1 is obvious and needs no further considerations. 

3. The exact distribution of the sample variance from a gamma parent 
distribution 

We need a moment generating function and a "kernel" Kp: 

/>oo 

Mo,{z) := {V{a)y^ I exp(-a;2)x"^i exp(a;z)da; (5) 
Jo 

oo 

= (2r(a)ri^r((a + fc)/2)zVfc! 

fc=0 

a^ ^ ,a 1 ^/Qf + lx ^ /Qf + l 3 z^ . 



(2r(a))- FQ ,F,{-^tr-r)^V{:^)z,F,[ 



If a e N then 



2' T ^' ^ 2 ' ^ 2 ' 2' 4 

a > 0, z e C. 



M„(z) = (2r(a))-iV^(^)" ' (exp(zV4) erfc(-z/2)) . (6) 



Kp(r, z) + ^ Bek{z){-rf jkl (7) 

oo 

= E (r(/3 + A:/2))"' oFi(/3 + A:/2, -rV2)(-rz) Vfc!, 
/3 > 0, r > 0, z e C, 



where Hek(z) = {-!)'' ip'-'''' {z)/(p{z) = 2 '"/'^ Hk{z / \/2) with the Hermitc polynomials 
Hk. 

The equivalence of the two series in ([7]) follows by inversion of the LT 

r^^{z + = t-l^ exp(-z/VI) exp(-l/(2t)) (8) 

oo oo 

= exp(-l/(2i))^r''~'=/2(-2)'=/fc! = ^(-l)fei7efc(z)i-''-^-/Vfc! 

fe=0 fc=0 

with t > Q and any fixed z. The inversion provides the two corresponding series for 
x(^-^Kp{^,z). 

From ([7]) we obtain also an integral representation of Kp by well known functions. 
Let be 

oo 

= (r(/?))-i ^F^{l,p,y^) + {T{p + l/2))-^y /3 + 1/2, y^), y g C. 

After Kummer's transformation (A. S. 13. 1.27) we find with (A. S. 13. 6. 10) and 

Gp(z) = {T{P)r^ S^xf^-^e-^dx 

W0{y) = exp(y2)2/-2('3-i)(G0„i(y2) + Gp^,/2{y')), P > l- 
In particular with /3 = 1 + ?n G N we have 

2m-l 

w^My) = y-'"(exp(y2) erfc(-y)- J] yVr(l+j72)), 

^1/2(2/) = yexp(2/^) erfc(-y) + 7r"^/^ 
w^i/2+m(2/) = y^^{wm{y) - l/(m- 1)!). 
Thus, with any p > 0, we obtain 

Kp{r,z)ip{z) ^ — / ip{z + pe^'^)wfj{p'^re~'''^)dip. 

K)j will be used only with real z. In this case the real part of the integrand is integrated 
on [0, tt]. 

Furthermore, from the 2"'' series in ([7]) and the integral representation of the Bessel 
fmictions Jp-i+k/2 it follows 

K^ir, z) = (2/V^) fj^ ( (i /3 - i, i r'z^ sin^ ^)/r(/3 ~ 1) - 

(rz sin i9) oF2(^,/3, ^ r^z^ sin^ ,9)/r(/3)) (sin^ z?)/^-! cos(rN/2 cosi?)di?, /? > ^ ■ 

For the proof of the following theorem 2 we need some knowledge concerning the 
behaviour of the functions and Kp for large |z|, z real. From the integral in 
and the Tauber theorem for Laplace transforms of measures (e.g. Feller, chapter 13) it 
follows 

lim |z|"M„(z) = 1. (9) 
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With (A.S.9.1.62) and (A.S.9.1.69) we have |oFi(/3 + /e/2, -rV2)| < 1. Then, from 
hnife^^oo e^'''(r(/3 + fc/2))^^ = for aU e > 0, we obtain with the second series in ([7]) 

Kp{r, z) = o(e'''^l^l), e > 0, -oo. (10) 
With (A.S. 22. 14.17) we get \Hek{z)\/kl < l.le^'/'^/Vfc! and consequently 

OQ 

|i^^(r,z)| < l.le"'/*^(^(/3 + fc/2)^/M)"V^ z e R. (11) 

From the last representation in ([5]) and (A.S. 13. 1.14) it follows the asymptotic relation 

M,(z) ~ iV^/T{a)) exp(2V4)(z/2)"-\ z ^ (». (12) 

Finally, replacing t by —it in ((8]) we find with z > and > 1 by the Fourier inversion 
formula 

/"OO 

\x'^-^K0{^/^,z)\ < TT-^ r''^ exp{-z/V2i)dt (13) 
Jo 

= (2^/7r)r(2/3-2)/z2/3-2^ 



Theorem 2. Let Xi,...,X„ 6e i.i.d. random variables with a gamma density 
exp(— a;)/r(a). With the functions Ma from 0) and Kp from the cdf of Q — 
Z]"=i(-'^4 ^ ^ n>2, is given by 



Pr{Q < a;} = Kan/2+i{V^^.z){Ma{^2/^ z)Y^{z)dz. (14) 



Proof. Due to the relations . with j3 = an/2 + 1 the integral in (dH) is 
absolutely convergent for all a > and a; > 0. In particular, the integrand is at least 
0(z~") for z oo. Furthermore, the LT 



e-*"x""/2( / \Kan/2+l{^/^,z)\(^U^/2/^z)r^{z)dz)dx 

'J —OC 

exists for all t > 0. Thus, changing the order of integration, the LT of 



a:""/' / Kan/2+iiV^, z){Ma{VV^ z)y^{z)dz 



is given by 



r (^(z+v/^VM)/^W)(Ma(y2A^z))"^(z)dz. (15) 



On the other hand, applying equation ([?]) to a sample of i.i.d. random variables with 
density exp(— a;)/r(Q!), we find the LT of the density of Q. 



/oo I'OO „ 

( j exp(-.T2 + (V27^ 2/ - llVt)x)x"''^dx) if{y)dy 

poo poo „ 

= (r(a))-"t"""/2 / exp(-x2 + xz)x°'-^dxj ip{z + ^n/{2t))dz 



with the substitution z — y — ^nj {2t). Therefore t ^£{e coincides with (llSp which 
completes the proof. 

In the same way the more general formula 

/OO ^ ^ 

( ^) ( n (^27^ 2)) ¥^(2)^2, « = 51 

J=l j=l 

is proved for independent r(aj)-distributed random variables Xj. 

4. The exact distribution of the sample variance from a uniform parent 
distribution 

Let Xi, . . . , Xn be i.i.d. random variables, uniformly distributed on the interval 
[0,1]. The maximum Qmax of Q = Y!H=ii-^i ~ ■^)'^ given by n/A if n is even and 
by (n^ — l)/(4n) otherwise. These values are reached if Xi{l — Xi) =0, i — 1, . . . ,n, 
with exactly [n/2] solutions = 1 or [n/2] solutions Xi = 0. For a formal proof set 
Yi ~ 2Xi — 1 = sin$i. The coordinates of the stationary points of AQ = Y^^=i sin^ '^i — 
l/"(S"=i sii^'^'i)^ satisfy Yi = ±1 or Yi = Y. It is easy to show that the maximum 
is only attained if ^ Y does not occur. Then the result follows by maximizing 
Q = k(n — k) /n, fc = 1, . . . , n — 1. 

From 111) the LT of Q follows by straightforward calculation. Hence, the Fourier 
transform / of the density / of Q is given by 

f{t)^^{^/2r-\z/tr'^ (erf(y+-V^-erf(y--V^)) dy (16) 

and the Fourier sinus coefficients of f{x), Q < x < Qmax, are 

hk = (2/(9max)3m(/(tfc)), ^^^^ 

J Akir/n , n even [ 



tk 



|4fc7rn/(n^ — 1) ,71 odd 



However, for numerical reasons, any values of erf(z) with |3to(2)| > |5Re(z)| should be 
avoided within the integral f{t). Therefore, the path of integration is shifted by 1/2^— zi. 
With Cauchy's integral theorem for holomorphic functions and 

lim erf (y ± ^/—it) — erf(y) = 0, 
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uniformly on intervals < t < T, the integral 



J (eri{y + 2 - erf {y - -V-it) ] dy 



2 

in (fT6|l can be replaced by the identical value 







(erf (y + V^) - erf (y)) dy (18) 

1 /"^/l 1 \" 

+ -\/^ / (^erf(-\/^(u+ 1)) -erf(-V^(ti- du. 

J 

With the values from p7|) this leads to 



Theorem 3. Let &e independent random variables, uniformly dis- 

tributed on the interval [0, 1]. The cdf of Q — Y^^=ii-^i ^ "^)^ ^■^ given by 

F{x) = Pr{Q <x]^ iv^(^AF/2)"-3 . 



(/ ^^^m((z/<fe)"/2^crf(2/ + V^)-erf(y))")(l-cos(<fca;))/fcdy + (19) 

1 /•! °° , 1 1 

2 / E 3"i((Vife)^""'^/'(erf (2 V^(u + 1)) - erf(-V^(u - 1)))' 

fe=i 



„i 00 

2 

(1 — cos{tkx)) / k du 

{n/4 , n even I 

[n'^ - l)/(4n) , n odd J 

T/ie density f{x) is obtained by the derivative under the integral at least for n > 4. 

Proof. The formal series 
2 " 



Y,^Mf{tk)){l-cos{tkx))/k (20) 



TT 

fc=l 



for i^(x) follows by integration of the formal Fourier sinus series for the density /. Since 
F{x) is geometrically the intersection volume of the unit n-cube with an n-cylinder of 
radius r = whose axis passes (0, . . . , 0) and (1, . . . , 1), we have F{x) = Cl(a;("-i)/2) 
and f{x) — 0{x'^"-^^^/^) for x — + 0+. Besides, f{x) tends to zero for x Qmax- Hence, / 
is square integrable for all n > 3 and consequently the formula (|20p provides an absolutely 
convergent series representation of F{x). After the replacement described in (jlSp the 
resulting series differs from (|19p only by the change of summation and integration. Since 
erf(z) is absolutely bounded on {z e C| |5m(z)| < |9(?e(z)|} the change is justified for the 
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2nd integral in p9)l . The first integral in p9)) is divided into + and we get with 
y > 1 and t = tk > the estimation 

I (erf (y + ^/—it) — erf (y)) j \/ —it | 



|(-*t) 



-1/2 



exp(— z^)(iz| 



2 

< / I exp(— t/^ — 2uy\J—it + 

< ^= / exp(— y^ — \PR uy)du 

Jo 



= t-^/\l - exp{-V2i y))expi~y^)/y 

< t-i/2exp(-y2)/y. 

From this a majorant is obtained for the above integral and theorem 3 is proved. 



Remark. In all the plotted examples the integrands in exhibited a very smooth 
appearance. Since the integrand of the first integral in (|19p tends to zero very rapidly the 
upper limit of integration can be lowered to moderate values for a numerical integration. 
By Mir and Richards (1975) for r < l/\/2 the elementary formula 

Pr{Q < r^} = (V^ 7r("-i)/Vr((n + l)/2))r"-i - &(n)r", n>3 (21) 

was derived where 6(3) = 2\/6 and 6(n) is given by an integral over the (n — 3)-unit 
cube for n > 3. The authors have computed 6(n) exactly for n = 3, 4, 5 and additionally 
for n = 6, 7 by numerical integration. Their values 6(n) have been checked by means 
of p9p and further values 6(n) can be computed now by univariate integration, e.g. 
6(8) = 13.03951 . . ., 6(9) = 11.72490 . . ., 6(10) = 9.90095 .... 
The distribution of 

(Q - £(Q))(var(Q))-i/2 ^ (g _ _ l)/12)((n - l)(2n + 3)/(360n))^'^' 

tends to a standard normal distribution for n ~f oo. 

An example for the application of the distribution of Q is the following statistical 
test. Let Y be continuously distributed with a cdf F and the median m. To test F = Fq, 
where the specified cdf Fq has the same median, against 

Fo(y) < F(y), y < m and Foiy) > F{y), y > m, 

we can use the test statistic Q = Yl^=iiUi ~ with Ui = FQ{Yi), based on a random 
sample Yi, . . . , y„, and reject F = Fq for large values of Q. In particular, with F{y) = 
G{{y — m)/ fi) and Jb(y) = G((y — m)/ [3q), f3 > (3o > 0, where G denotes a continuous 
(not normal) cdf with median(G') — 0, this is a test for the scale parameter f3. It is 
supposed that the test works also if the medians of F and Fq differ only slightly, but the 
effect to the power is difficult to assess. The simpler test statistic X]"=i(^i ~ 1/2)^ with 
the exact cdf 

oo 

2-(n-i)^V2-i^<5^|^(-erf(^_i/fc7r/n)/A/-i/c7r/7i)")(l-cos(4fc7ra;/n))/fc, < x < n/4 
fc=i 

can also be used. 
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